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) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ACAAACAGGA AGGACAGCAG GCTCTGGCAG CCAGAAGCCT GTGGCCCCAA GCTGGCAGGA 60 

TGGCCCCCTT CCTGCAGGTC CCCCACAGCC TTCTGGGTTC CTGACACGAG AGAAGAGGTG 120 

GGGCGGGGTG AAGTGAACTC TGAAGCCAAA ATGTGACTCT CCTGGGGTCA CCAGCTTGGG 180 

GAGAGGTGAA GAAAGATGCC GGGGCGGAAA CAAAGGGGCA GATATCACTA TGGTTATCTT 240 

ACTAAGCACA GAGTAACTGA AAAAGCAAGG GTACCGCTGC CCACCTCGTG CCCACCTTAC 300 

GTTATACCTC AAACCAGCTA GATAGTTTCT GATGGCACCC ATACCCTCCC TTCCCCTTTA 360 

GGCATTGCGC AAGCTCTCCA CCACAATCTG GAAGTTATAC CCTGCGAGGG GATGGGCAGG 420 

GCACTTCTGA GGTGCCAATC AGCCTGCACT CGCCTCTGCC CTGGCCATGG CACTGCTGTC 480 

AGTTTCTTGG TACCTGTCTC AACAGCAGCC TTGTCACGTG AGACTATGGC TGGCGGTGGG 540 

GqipGGGGCA GGAATCCTAG AAGCACAGGA GTGACATAGG GTCGGGTCGG GCAGAGCGAA 600 

GT^TAGGAGG TGATCCCCAA AGGGATGCTG GGGACGATCT GGCCAACACT GTCCTCCCAT 660 

TCAAAACTCC CAGTCTGGAG CTCTGGGACA TGGACAAGCC AGGCCTGCTA TTCTCCATAC 720 

AcSgCTCCAT AGTGTCTGGC TCAGCAGAGT GGGGGATCTG GTGGGGATGG AGGAAGCTTA 780 

GC^AAAAGCT TTGTATAGGC TGAAGCTCTG AGTGACCCTG CTGGGCCACC CTACCCTGGT 840 

CbIgGCTGGG TCATTGCATC CCCAGATTGG AAGGCTTGGT GAGATGGAGA GGAACCTTGG 900 

;1I 

CMCAAGCTA TAGCTTTGCC CACCAGAGCC TGCTGGAGGG GAATCAAACA AGCCTGGACC 960 

T^|.GGCTGGG ACTAGCTTTC CTGTTTCTGG AGTGGATGCC AACCCCCTGC CCACCAGCCT 102 0 

GCCTGTCCAC GCCAGGGACA CACAGACTCC TTCCCTTTCC AGACTGGAAA GCCCCCTCCT 1080 

GGGAGAGCAG GAAGGAAGCA ACCTGCAACT CTTCCAGCCC TGGACCTTGG GCTGAACCTA 1140 

CAGTTCAAGG TTTGTATGCT CACAGGTCTT GGCAGGGAAA GATAAGAATC CCCAGGGCAC 1200 

CCTCCCCCCC GCCCCCCAGT CCACTGCAGG TAGCTCCTGG GTCTGCCCTT CAGGGCAAGT 1260 

GCTGACGCTC CATCAGACTG TGATGGGGCC CTTTTCTGAG GATGACAATT CTGAGAACAA 1320 

GGCATTTTTC TAGAGGTGGC AGAACAGCAT TTTGTGATGC CCGAGGATCT GGGAGCACAG 1380 

GTCCAGCTTA ATGAGGGATT GGAGGAAGTG GGTATCATCA TTACAGGGAG GGGCCTCTGT 1440 

GGCCTCCTGG GAAAATGCAG TTGCTCTCTT TGGGTGGCCT GGGGTTGTGT GGTGGGCAGA 1500 

GGACGGAGGT GCTCATTGGG GGAAGGGATC ACTTCTGCTC AGAGTGCTCG CAAGGGCCTT 1560 

TCCTTTTCCT GAAGGCAAGC AGGCCTCCTC CTCCTCCTCT TCCTCCTTCT CCTCTTCCTC 1620 

CTCTTTCTCC ATATGCCTAG CTGGTCATTT CTAGGGACCA GCATGGTTGG GAAGGGGGCC 1680 



TTGTCTTGGC CTTCCTCTTG TCTCAATTCC CTCTTTGAGC AGAAGACGGG GTGGGTGGGG 1740 

TAGGATTGGA TAGTGGTTGA TGCCAAAGAT TGA^GGGGTA GGGCGGGGCA GAAGTGGGAA 1800 

GGTCCCTGGC TTCCTCACCT TGGTAGATGG TGAGGAGCCC CAGAGGTTGA GCTGAGCAGC 1860 

AGCTGTGATT TCAGGGTGCC TCTGTTGGAG AGGCTGCTGT GATTTGAAAA TCTTCTTTCC 1920 

TTGGTGACAA TTCCAGAAGG CTCCAGATGA ATTGTATTGG TGAGTGCCTG GCCCTTAAGC 1980 

AGTCCCAGCT GGGGATGATG GGGATTTATG GGTGTCCCTG AGCCTAGGGT GACAGGGCCT 2 040 

CTCCTTTTTT TTTTATTCTG CTTCAGGGTA CCACCCCACC AGGAGGCTGC GGGCCTGGGG 2100 

CGGCCTAGCT GGAGGAGCAA CATTCATGGT AATTTGGTTT TTCTGGCTGT GGGGATCAGC 2160 

TCCTGGAAGT GCCCCTGTGC CTCAGTCCAC ACTCACCATC CTTATCTGGC ACTGGCCTTT 2220 

CACCAACCGG CCGCCAGAGC TACCTGGTGA CACCTGCACT CGCTATGGCA TGGCCAGCTG 2280 

CCGTCTGAGT GCTAACCGGA GCCTGCTAGC CAGTGCTGAT GCTGTGGTCT TCCACCACCG 2340 

TGAGCTGCAA ACCCGGCAAT CTCTCCTACC CCTGGACCAG AGGCCACACG GACAGCCTTG 2400 

GGTCTGGGCC TCCATGGAAT CGCCCAGTAA TACCCATGGT CTCCATCGCT TCCGGGGCAT 2460 

CTTCAACTGG GTGCTGAGCT ATCGGCGTGA TTCAGATATC TTTGTACCCT ACGGTCGCTT 2520 

GGAGCCTCTC TCTGGGCCCA CATCCCCACT ACCGGCCAAA AGCAGGATGG CTGCCTGGGT 2580 

GATCAGCAAT TTCCAGGAGC GGCAGCAGCG TGCAAAGCTG TACCGGCAGC TGGCCCCTCA 2640 

TCTGCAGGTG GATGTGTTCG GTCGCGCCAG CGGACGGCCC CTATGCGCTA ATTGTCTGCT 2700 

GCCCACTTTG GCCCGGTACC GCTTCTACCT GGCCTTTGAG AACTCACAGC ATCGGGACTA 2760 

CATCACTGAG AAGTTCTGGC GCAATGCCCT GGCGGCTGGT GCTGTACCCG TGGCGCTGGG 2820 

ACCTCCTCGG GCCACCTACG AGGCTTTTGT GCCACCAGAT GCCTTTGTAC ACGTGGACGA 2880 

CTTCAGCTCT GCCCGTGAAC TGGCTGTCTT CCTCGTCAGC ATGAATGAGA GTCGTTATCG 2940 

TGGCTTCTTT GCTTGGCGAG ACCGGCTCCG TGTGCGGCTC CTGGGTGACT GGAGGGAGCG 3 000 

CTTCTGCACC ATCTGTGCCC GCTACCCTTA CTTGCCCCGC AGCCAGGTCT ATGAAGACCT 3 060 

TGAAAGCTGG TTCCAGGCTT GAACTCCTGC TGCTGGGAGA GGCTGGATGG GTGGGAGACT 3120 

GATGTTGAAA CCAAAGAGCT GGGCATCCAG GCTTTTGGTC ACCATGGCAC TACCCCAAGG 3180 

CTTTTCCTGT TCAGTGAGCA GGAATTCAGG ATATAAGGAG AAGACTGGGC TGAGATACCC 3240 

TGGTGGGCTT TAGAGTAGGG GCCCAGGATA AGAGACAATG AATTAATGAG GAGCATATGG 3300 

GGAAGGTGGC TGAGGGTCCC TGACTTACCT TGACCCATGG CTGAAGGCTC CATGCCCATG 3 3 60 

GCTGGAGCTG GGACCCTACA CTTCTATAGT CAAGGTGCTT AGCCTCAAGG TTGCAGATGC 3420 



ACCCTCTAGT ACTCTGGGTG CAGACTGTAC ACTGGGCGCA GGGGGTTGTG GAAGGACAGT 
GCAGATGATT CTGGGCTTTT GACACCACAG TTCCCCCAGG GAAAGAGGCA CTACTAATAA 
AAACACTGAC AGAAATCTCC TGGTCAAGTC TGTTAGGCAG CAGAGCTCGA ATTC 



3480 
3540 
3594 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Pro Thr Pro Cys Pro Pro Ala Cys Leu Ser Thr Pro Gly Thr His 
15 10 15 

Arg Leu Leu Pro Phe Pro Asp Trp Lys Ala Pro Ser Trp Glu Ser Arg 
20 25 30 

Lys Glu Ala Thr Cys Asn Ser Ser Ser Pro Gly Pro Trp Ala Glu Pro 
35 40 45 

Thr Val Gin Met Asn Cys lie Gly Tyr His Pro Thr Arg Arg Leu Arg 
50 55 60 

Ala Trp Gly Gly Leu Ala Gly Gly Ala Thr Phe Met Val lie Trp Phe 
65 70 75 80 

Phe Trp Leu Trp Gly Ser Ala Pro Gly Ser Ala Pro Val Pro Gin Ser 
85 90 95 

Thr Leu Thr lie Leu lie Trp His Trp Pro Phe Thr Asn Arg Pro Pro 

100 105 110 

Glu Leu Pro Gly Asp Thr Cys Thr Arg Tyr Gly Met Ala Ser Cys Arg 
115 120 125 

Leu Ser Ala Asn Arg Ser Leu Leu Ala Ser Ala Asp Ala Val Val Phe 
130 135 140 

His His Arg Glu Leu Gin Thr Arg Gin Ser Leu Leu Pro Leu Asp Gin 
145 150 155 160 

Arg Pro His Gly Gin Pro Trp Val Trp Ala Ser Met Glu Ser Pro Ser 
165 170 175 



Asn Thr His Gly Leu His Arg Phe Arg Gly lie Phe Asn Trp Val Leu 
180 185 190 



Ser Tyr Arg Arg Asp Ser Asp lie Phe Val Pro Tyr Gly Arg Leu Glu 
195 200 205 

Pro Leu Ser Gly Pro Thr Ser Pro Leu Pro Ala Lys Ser Arg Met Ala 
210 215 220 

Ala Trp Val lie Ser Asn Phe Gin Glu Arg Gin Gin Arg Ala Lys Leu 
225 230 235 240 

Tyr Arg Gin Leu Ala Pro His Leu Gin Val Asp Val Phe Gly Arg Ala 
245 250 255 

Ser Gly Arg Pro Leu Cys Ala Asn Cys Leu Leu Pro Thr Leu Ala Arg 
260 265 270 

Tyr Arg Phe Tyr Leu Ala Phe Glu Asn Ser Gin His Arg Asp Tyr lie 
275 280 285 

Thr Glu Lys Phe Trp Arg Asn Ala Leu Ala Ala Gly Ala Val Pro Val 
290 295 300 

Ala Leu Gly Pro Pro Arg Ala Thr Tyr Glu Ala Phe Val Pro Pro Asp 
305 310 315 320 

Ala Phe Val His Val Asp Asp Phe Ser Ser Ala Arg Glu Leu Ala Val 
325 330 335 

Phe Leu Val Ser Met Asn Glu Ser Arg Tyr Arg Gly Phe Phe Ala Trp 
340 345 350 

J^rg Asp Arg Leu Arg Val Arg Leu Leu Gly Asp Trp Arg Glu Arg Phe 
355 360 365 

Cys Thr lie Cys Ala Arg Tyr Pro Tyr Leu Pro Arg Ser Gin Val Tyr 
370 375 380 

Glu Asp Leu Glu Ser Trp Phe Gin Ala 
385 390 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GCGCGGATCC CACCATCCTT ATCTGGCACT GGCCTTTCAC C 



41 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCGCGGATCC AGTTCAAGCC TGGAACCAGC TTTCAAGGTC CTTC 



44 



